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Abstract 


A  model  for  the  stochastic  analysis  of  directed  acyclic  graphs  is  developed.  These  graphs  represent 
event-precedence  networks  where  the  distribution  function  associated  with  an  event  is  assumed  to  be  a 
variant  of  the  phase-type  distribution.  Events  may  occur  sequentially,  probabilistically,  or  concurrently. 
The  distribution  function  of  the  graph  execution  time  is  computed  in  a  semi-symbolic  form.  Applications 
of  the  model  for  the  evaluation  of  concurrent  program  execution  time  and  to  the  reliability  analysis  of 
fault-tolerant  systems  are  discussed. 

1.  INTRODUCTION 

Many  interesting  problems  in  the  design  and  analysis  of  multiple  and  distributed  processing  systems 
need  to  be  solved  in  order  to  provide  their  designers  and  users  with  insights  and  tools  for  system  evalua¬ 
tion.  Most  current  work  on  performance  and  reliability  analysis  of  parallel  and  distributed  systems  by 
means  of  analytic  models  may  be  classified  as  either  prognm-eenterei  (transaction-centered)  or  resseree- 
e entered  analysis. 

In  resource-centered  analysis,  system  resources  are  modeled  in  great  detail  while  a  relatively  simple 
model  of  transaction  behavior  is  assumed.  Maikov  and  semi-Markov  models  have  been  used  for  reliability 
and  availability  analysis  [1,2].  Product-form  queueing  networks  have  been  used  to  analyze  the  perfor¬ 
mance  of  computer  systems  [1,3],  communication  networks,  and  computer-communication  networks  [4]. 
The  existence  of  a  product-form  solution  implies  a  relatively  efficient  numerical  procedure  to  obtain  the 
solution.  However,  real  system  behavior  rarely  satisfies  the  necessary  assumptions  of  a  product-form  net¬ 
work.  For  example,  transactions  with  internal  concurrency  will  violate  product-form  [5,0,7].  Many 
authors  have  studied  approximate  and  exact  solution  techniques  for  solving  queueing  models  where  pro¬ 
grams  are  allowed  to  overlap  their  computation  with  their  own  input-output  operations  [6,7]. 

In  program-centered  analysis,  a  relatively  simple  model  of  system  resources  is  assumed  while  the 
characteristics  of  transactions  are  modeled  in  great  detail.  Behavior  of  program  execution  in  the  face  of 
system  failure/repair  [8]  and  software  failure  [9]  has  been  modeled.  If  we  consider  performance  analysis  in 
the  absence  of  failures  and  assume  that  transactions  possess  internal  concurrency,  then  transactions  can  be 
represented  by  precedence  graphs  [22].  Deterministic  analysis  of  task  precedence  graphs  and  the  schedul¬ 
ing  of  these  graphs  is  known  to  be  important  [22].  Adding  randomness  to  such  graphs,  we  obtain  stochas¬ 
tic  activity  networks  [10,11].  At  least  three  approaches  to  the  analysis  of  such  networks  can  be  identified: 


Markov  chain  techniques,  stochastic  Petri  net  techniques,  and  path  analysis. 

The  first  approach  is  to  express  the  activity  in  the  form  of  a  continuous-time  Markov  chain  {12,13]. 
This  approach  restricts  the  node  times  to  be  exponentially  distributed,  and  also  quickly  leads  to  an  explo¬ 
sion  in  the  state-space  of  the  Markov  chain. 

The  second  approach  uses  Petri  net  models.  Ramamoorthy  and  Ho  use  this  approach  in  the  case 
that  node  times  are  deterministic  (14].  Molloy  considers  exponentially  distributed  node  times  and  con¬ 
verts  the  Petri  net  into  a  Markov  chain  for  analysis  [15].  We  have  allowed  the  node  times  to  be  generally 
distributed  in  our  Extended  Stochastic  Petri  Net  (ESPN)  Model  [16].  Whenever  possible,  the  ESPN  is 
automatically  converted  into  a  Markov  chain  or  a  semi-Markov  process.  If  neither  of  these  approaches 
succeeds  then  the  ESPN  is  evaluated  using  Monte-Cario  simulation.  The  first  two  approaches  for  the 
solution  of  an  ESPN  can  lead  to  large  state  spaces,  while  die  third  approach  can  be  time  consuming  due 
to  the  inherent  speed  limitations  of  a  simulation  model. 

The  approach  developed  in  this  paper  falls  into  the  path  analysis  category.  The  path  analysis  tech¬ 
nique  first  computes  the  distribution  of  the  time  to  traverse  each  path.  For  complex  graphs  the  number 
of  paths  can  be  rather  large,  making  the  technique  computationally  expensive.  In  the  general  case,  over¬ 
lapping  paths  exist  and  hence  one  can  only  obtain  an  approximation  (or  bounds)  for  the  overall  execution 
time  [11], 

If  the  shape  of  the  graph  is  restricted  to  series-parallel,  the  overall  execution  time  can  be  obtained 
exactly.  This  is  the  approach  taken  by  Robinson  [17]  and  Kleinoder  [18]  in  using  directed  graphs  for  the 
performance  analysis  of  concurrent  programs.  We  also  do  this,  but  our  model  allows  for  multiple  paths  to 
be  interpreted  in  a  variety  of  manners,  not  just  as  concurrent  program  execution.  Therefore,  our  graphs 
can  be  used  to  model  reliability  as  well  as  program  execution.  Klein  Oder’s  approach  differs  from  ours  in 
that  he  performs  numerical  convolutions  and  other  such  operations  on  empirical  distributions.  Thus  his 
approach  avoids  any  distributional  assumptions.  However,  our  approach  yields  results  in  semi-symbolic 
form  and  is  faster. 

In  this  paper  we  consider  the  analysis  of  node-activity  networks  that  are  series-parallel  graphs. 
With  each  node  in  the  graph  is  associated  a  distribution  that  has  exponential  polynomial  form.  This  form 


it  quite  general,  and  includes  Neuts'  phase-type  distributions.  The  division  of  a  graph  into  parallel  sub¬ 
graphs  can  be  interpreted  as  ether  probabilistic  or  deterministic.  In  the  deterministic  ease,  the  time 
needed  to  traverse  aD  of  the  subgraphs  may  be  either  the  maximum  or  minimum  of  the  time  needed  to 
traverse  the  individual  subgraphs.  The  distribution  function  of  the  total  time  to  traverse  the  graph  is  stu¬ 
died.  A  program  called  SPADE  [Series -£AarsUel  directed  acyclic  graph  £vaieetor )  has  been  written  to 
compute  this  distribution.  Applications  of  the  use  of  our  model  include  performance  analysis  of  con¬ 
current  programs  and  reliability  analysis  of  non- repairable  fault-tolerant  systems. 

In  section  2,  we  discuss  the  model.  Section  3  describes  the  analysis  of  the  model  and  die  SPADE 
program.  In  section  4  we  give  examples  illustrating  the  use  of  our  approach.  The  examples  chosen  are 
very  simple,  for  the  purpose  of  exposition.  More  complex  problems  can  be  and  have  been  solved  by  the 
model. 

2.  THE  SPADE  MODEL 

The  SPADE  model  consists  of  a  series-parallel  acyclic  directed  graph  with  the  nodes  representing 
events  and  the  edges  representing  a  precedence  relation  between  the  events.  Such  graphs  are  useful  for 
modeling  many  different  kinds  of  activities.  The  two  main  applications  are  performance  and  reliability 
analysis.  In  this  section  we  describe  the  SPADE  model.  First  we  give  a  performance  model  example  and 
a  reliability  model  example,  to  illustrate  how  directed  graphs  are  used  as  models.  Then  we  define  series- 
parallel  graphs  and  describe  how  they  are  interpreted  by  SPADE. 

2-1.  Graph  Model  Examples 

As  an  example  of  a  performance  model,  we  consider  the  process  communication  graph  from  Kung’s 
thesis  [13],  shown  in  figure  1.  There  are  four  tasks  to  be  executed.  Tasks  1  and  2  are  executed  on  one 
processor  and  tasks  3  and  4  on  another  processor.  Tasks  2  and  3  require  results  from  task  1  and  task  4 
requires  results  from  tasks  2  and  3.  Once  task  1  has  completed,  tasks  2  and  3  may  be  executed  at  the 
same  time.  When  they  are  both  finished,  task  4  may  be  executed.  Because  tasks  1  and  3  and  tasks  2  and 
4  are  assigned  to  different  processors,  data  must  be  communicated  between  processors.  The  communica¬ 
tion  time  between  tasks  1  and  3  and  tasks  2  and  4  is  modeled  by  the  nodes  5 13  and  S The  execution 


time  for  the  a  task  is  exponentially  distributed  with  parameter  X.  The  execution  time  for  a  data  com¬ 
munication  node  is  also  exponentially  distributed  with  parameter  «  *  X. 

If  the  model  is  evaluated  using  the  standard  Markov  approach,  the  corresponding  Markov  chain  will 
have  ten  states.  Subsequently,  the  Markov  chain  would  be  solved  by  any  number  of  known  techniques. 
SPADE  avoids  the  state  space  expansion  by  analysing  the  model  directly.  To  further  contrast  the  two 
approaches,  the  SPADE  approach  allows  a  much  more  general  distribution  type  than  exponential,  but 
puts  restrictions  on  types  of  graphs.  Markov  models  allow  cycles  and  general  graphs  but  suffer  from  large 
state  spaces,  particularly  if  we  wish  to  model  non-exponential  behavior. 

To  see  how  a  directed  graph  can  be  used  to  model  reliability,  consider  the  system  of  components 
pictured  in  figure  2a.  The  system  consists  of  three  parallel  subsystems,  and  functions  if  any  one  of  the 
subsystems  is  working.  Each  subsystem  is  composed  of  a  series  of  components,  all  of  which  must  function 
in  order  for  the  subsystem  to  work. 

Previous  approaches  to  solving  such  reliability  problems  include  combinatorial  analysis  and  Markov 
ehains(l].  Combinatorial  analysis  works  well  for  "pure*  series-parallel  systems,  but  if  we  allow  standby 
redundancy  as  well  the  combinatorial  approach  quickly  becomes  intractable.  The  Markov  chain  approach 
suffers  from  the  curse  of  dimensionality;  for  an  n-component  system,  the  Maikov  chain  contains  2"  states. 
The  Markov  chain  approach  also  makes  restrictive  assumptions  on  distributions.  The  SPADE  approach 
appears  to  overcome  both  of  these  problems.  It  should  be  mentioned,  however,  that  the  Markov  chain 
approach  does  not  impose  the  structural  restrictions  and  independence  assumptions  of  our  approach. 

To  analyse  the  model  using  SPADE,  the  model  would  be  transformed  to  the  graph  shown  in  figure 
2b,  which  represents  the  time  to  failure  of  the  system.  For  a  series  of  components,  we  must  take  the 
minimum  of  the  probability  functions  of  the  individual  components  (the  series  fails  as  soon  as  one  com¬ 
ponent  fails).  For  subsystems  in  parallel,  we  must  take  the  maximum  of  the  probability  functions  (the 
system  fails  only  when  all  parallel  subsystems  fail). 


]J.  Sri—  Parallel  Graphs 


The  graphs  in  the  two  examples  in  the  previous  section  are  simple  examples  of  the  class  of  series- 
parallel  graphs.  There  are  several  nearly  equivalent  definitions  for  the  term  'series-parallel*  [20]  [21} [25] ; 
we  define  the  term  as  follows.  A  finite  linear  graph  is  defined  to  be  a  quadruple  G  — (N ,E  ,S ,T)  where 

o  N  is  a  finite  set  of  elements  called  nodes 

o  E  is  a  subset  of  N  x  N ,  called  the  set  of  edges 

o  S  is  the  subset  of  N  containing  thoee  nodes  that  are  not  the  second  member  of  any  edge  in  E 
(these  are  the  entrance  nodes). 

o  T  is  the  subset  of  N  containing  those  nodes  that  are  not  the  first  member  of  any  edge  in  E  (these 
are  the  exit  nodes). 


Suppose  Gi—(Nlt  Ei,  Sit  Tx)  and  <?a“ {Na,  Ea,  5*,  Ta)  are  nonintersecting  graphs.  A  graph 
G “(AT,  E ,  S ,  T)  is  the  series  connection  of  <?t  and  Gt  if  and  only  if 

o  At  least  one  of  T ,  and  Sa  contains  exactly  one  node. 

o  W-N.uNj 

o  £-£iU£jU(r,xS,) 

o  5 —Si,  r-r2 

A  graph  G  is  the  parallel  combination  of  G  i  and  G3  if  and  only  if 
o  N—Ni  U  Na 
o  U  E3 


o  S— S,  U  S2,  T— T,  U  Ta 


The  clan  of  series-parallel  graphs  is  the  smallest  elan  of  graphs  containing  the  unit  graphs  (graphs 
consisting  of  one  node)  and  having  the  property  that  whenever  G  is  the  series  or  parallel  connection  of 
two  graphs  in  the  clan,  then  G  is  in  the  clan.  A  series-parallel  graph  is  by  definition  acyclic  and  contains 
no  redundant  edges. 

Figure  3  shows  an  example  of  a  series-parallel  graph  and  the  sequence  of  series  and  parallel  combina¬ 
tions  from  which  the  graph  can  be  built.  Figure  4  shows  two  graphs  which  are  not  series-parallel. 

2.3.  Graph  Interpretation 

Each  node  in  a  graph  represents  an  event  whose  length  is  specified  by  a  cumulative  distribution 
function  (CDF).  Given  a  graph,  SPADE  will  compute  the  distribution  function  for  the  time  taken  to 
"traverse”  the  entire  graph.  The  definition  of  what  it  means  to  "traverse*  a  graph  G  is  recursive. 

If  G  consists  of  a  single  node,  the  traversal  time  is  given  by  the  CDF  associated  with  that  node.  If 
G  was  formed  by  combining  the  subgraphs  G  i  and  G2  in  series,  then  in  order  to  traverse  G,  we  must  first 
traverse  G ,  and  then  traverse  G2  If  G  is  the  parallel  combination  of  G ,  and  <7S,  we  allow  the  parallel¬ 
ism  to  be  interpreted  in  one  of  three  ways. 
prob*bilutic 

Only  one  of  the  subgraphs  is  actually  traversed.  Each  subgraph  has  associated  with  it  the  proba¬ 
bility  that  it  is  chosen  for  traversal.  G  has  been  traversed  when  one  of  the  subgraphs  has  been 
traversed, 
msrimsm 

The  two  subgraphs  are  traversed  concurrently.  Traversal  of  G  is  complete  when  traversal  of  both 
G i  and  G2  is  complete, 
mtnimsm 

The  two  subgraphs  are  traversed  concurrently.  Traversal  of  G  is  complete  when  traversal  of  the 
first  subgraph  to  to  finish  is  complete. 

Suppose  we  are  modeling  program  execution.  Graphs  with  only  probabilistic  parallelism  will  model 
flowcharts  of  loop-free  sequential  program*.  If  we  only  allow  maximum  concurrency  then  the  graphs  will 
correspond  to  the  task  precedence  graphs  considered  in  [22].  Minimum  concurrent  subgraphs  will  model 


(he  parallel  execution  of  a  non-deterministie  algorithm  [23]  in  which  the  verification  of  all  guessed  solu¬ 
tions  is  attempted  concurrently,  and  the  first  guess  to  be  verified  provides  a  solution  to  the  whole  prob¬ 
lem. 

The  graphs  can  also  be  used  to  model  the  lifetime  of  closed  (non- repairable)  fault-tolerant  systems 
with  permanent  faults.  Such  systems  are  defined  in  [24],  where  they  are  analyzed  by  Markov  chain  tech¬ 
niques.  A  system  consisting  of  a  series  combination  of  components  is  modeled  by  parallel  graph  nodes 
with  type  minimum;  a  parallel  combination  is  modeled  by  parallel  graph  nodes  with  type  maximum.  We 
should  note  that  our  graphs  allow  more  general  distributions  of  subsystem  or  component  lifetimes  than 
those  allowed  by  the  Markov  chain  techniques  used  in  [24]. 

3.  GRAPH  ANALYSIS 

The  analysis  of  a  SPADE  model  has  two  phases.  First,  the  graph  is  decomposed  into  a  binary  tree. 
Then  the  tree  is  used  to  obtain  the  CDF  for  the  graph  traversal  time.  In  this  section  we  describe  the  two 
phases. 

3.1.  True  Decomposition 

Any  series-parallel  graph  can  be  decomposed  into  a  binary  tree,  where  the  internal  nodes  of  the  tree 
are  of  type  ’series”  or  ’parallel”,  and  the  leaves  of  the  tree  are  the  nodes  of  the  graph.  Valdes,  Tarjan 
and  Lawler  [20]  present  an  elegant  algorithm  for  performing  the  decomposition;  the  algorithm  presented 
here  is  less  elegant  but  simpler. 

Suppose  we  have  a  graph  <7.  Let  S  “  {xt,z3,  •  •  •  }  be  the  set  of  entrance  nodes  in  <7.  For 

any  node  x ,  define  A  [x )  to  be  the  set  consisting  of  x  itself  and  all  descendants  of  x .  Define  B  to  be  the 
set  D  A  lx ).  Thus  B  consists  of  those  nodes  in  G  which  are  commonly  descended  from  every  entrance 

•  €S 

node.  We  have  the  following  lemma: 

Lemma: 

Suppose  G  is  the  series  combination  of  G  (  and  G  2,  and  G ,  is  the  parallel  combination  of  two  sub¬ 
graphs.  Then  the  graph  <7S  must  have  a  single  entrance  node,  and  that  node  is  the  first  node  in  B . 


%  definition,  once  G  x  and  £»a  were  combined  in  series,  either  Gx  has  a  single  exit  node  or  G3  has  a 
angle  entrance  node.  <7}  has  multiple  exit  nodes,  because  it  has  parallel,  hence  disjoint  subgraphs, 
each  of  which  has  at  least  one  exit  node.  Therefore  G’2  has  a  single  entrance  node  y . 

When  Gx  and  f7a  were  combined,  all  of  the  exit  nodes  in  G  x  were  connected  to  jr .  Every  entrance 
node  in  £7)  is  either  itself  an  exit  node  of  <7j,  or  has  some  exit  node  as  its  descendent.  Therefore  y 
is  descended  from  every  entrance  node  of  Gx,  and  hence  y  is  in  B .  Furthermore,  because  £7,  has 
disjoint  subgraphs,  no  node  in  G  x  is  descended  from  every  entrance  node  of  G  lt  hence  no  node  in 
G ,  is  in  B .  Therefore  y  is  the  first  node  in  B  . 

Armed  with  this  lemma,  we  proceed  to  describe  a  recursive  decomposition  algorithm  for  series  paral¬ 
lel  graphs.  For  any  graph  G ,  let  Tq  be  the  tree  which  represents  the  decomposition  of  G .  There  are 
four  cases. 
ease  l:  G  «**  {x  } 

Tg  consists  of  the  single  node  x . 
cue  2:  |  G  |  >  1,  5  — ■  {x  } 

T0  consists  of  a  root  node  of  type  ’aeries*  with  left  subtree  the  single  node  x  and  right  sub¬ 
tree  the  tree  Ta 

ease  3:  |  G  |  >  1,  |  5  |  >  1,£— 0 

Since  G  has  more  than  one  entrance  node  and  the  entrance  nodes  have  no  common  descen¬ 
dent,  G  can  be  divided  into  disjoint  parallel  subgraphs.  Let  GX~*G  n  A  (zx)  and 
<7a  — =  G  -  A  (xj).  Note  that  A  (x,)  need  not  be  a  subset  of  G ,  since  G  may  be  some  sub¬ 
graph  which  contains  xj  but  not  all  of  its  descendents.  For  each  of  x2,  •  •  ■  ,x. ,  if 
Gx  H  A  (z{  then  set  Gx  <?j  (J  A  (x,- )  and  Gs  *=  <7a  -  A  (x,- ).  Then  To  consists  of  a 

root  node  of  type  ’parallel”  with  left  subtree  Tqx  and  right  subtree  T0a- 
ease  4:  |  G  |  >1,  |  S  |  >  1,  B  +Q> 

Since  B  is  not  empty,  we  cannot  have  a  parallel  decomposition,  and  hence  must  have  a  series 
decomposition.  Let  Gx  and  (7a  be  two  series  subgraphs  of  G  such  that  Gx  is  minimal.  G , 
cannot  consist  of  a  single  node,  since  then  we  would  have  |  5  |  »  1.  Therefore,  Gx  must  be 
composed  of  subgraphs,  and  those  subgraphs  must  be  parallel,  otherwise  Gx  would  not  be 


minimal.  Now  we  have  the  conditions  of  the  lemma,  and  hence  the  smallest  node  in  £  is  the 
single  entrance  node  of  C2.  Let  y  be  that  node.  Then-  Gj  =  G  -  A  (y )  and 
G2  »  G  n  A  (y ).  Tg  consists  of  a  root  node  of  type  "series”  with  left  subtree  TQi  and  right 
subtree  Tg. 

Figure  5  illustrates  the  decomposition  process  by  showing  an  example  of  cases  2,  3  and  4.  Figure  6  shows 
a  series-parallel  graph  and  the  complete  binary  tree  associated  with  it.  Note  that  for  SPADE  the  parallel 
nodes  are  divided  into  three  types:  "maximum”,  "minimum",  and  "probabilistic”. 

The  tree  decomposition  is  not  necessarily  unique,  but  all  possible  decompositions  of  a  graph  are 
equivalent  in  the  sense  that  the  probability  distributions  as  computed  by  all  of  the  possible  binary  trees 
are  the  same.  This  is  true  because  "maximum*  and  *  minimum”  are  associative,  and  for  the  probabilistic 
nodes,  multiplication  is  distributive  over  addition. 

3.2.  The  CDF  of  a  Tree 

Given  a  series-parallel  directed  graph  and  a  CDF  for  the  traversal  time  of  each  node,  one  can  calcu¬ 
late  the  CDF  of  the  traversal  time  for  the  entire  graph  by  using  the  graph’s  decomposition  tree.  For  a 
tree  node  A  ,  let  FA  be  the  CDF  of  the  time  needed  to  traverse  the  tree  rooted  at  A  .  The  calculation  for 
Fa  depends  on  the  type  of  node  A  . 

If  A  is  a  leaf,  then  A  represents  an  event  node  in  the  series-parallel  graph.  FA  is  simply  the  CDF 
specified  for  that  node.  Now  suppose  A  is  not  a  leaf,  and  that  its  two  subtrees  are  B  and  C .  If  A  is  a 
series  node,  then  A  represents  the  traversal  of  the  subtree  B  followed  by  the  subtree  C .  FA  is  given  by 
« 

1)  fs(0  =  pv  (*)Fc(t-z)rfx 

o 

If  A  is  a  maximum  node,  then  A  represents  the  maximum  of  the  traversal  times  of  subtrees  B  and  C ; 


2)  FA(t)~FB(t)Fc(t) 

If  A  is  a  minimum  node,  then  A  represents  the  minimum  of  the  execution  times  of  subtrees  B  and  C ; 
therefore 
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S)  F4(l)-/i(l)  +  Fe(«)-/»(l)M0 

Finally,  if  A  is  probabilistic,  then  only  one  subtree  of  A  will  be  executed.  Suppose  the  probability  that 
subtree  B  is  executed  is  Pg  and  the  probability  that  subtree  C  is  executed  is  pc .  Then  FA  is  given  by 

*)  FaU)  *  Pb^bIO  +  PcFc{*) 

It  would  be  possible,  given  any  series-parallel  graph,  to  compute  numerically  the  value  of  the  CDF 
of  the  entire  associated  tree  given  any  value  of  I .  If  the  type  of  the  CDF’s  is  restricted  to  be  of  exponen¬ 
tial  polynomial  form  and  the  parameters  are  given,  then  the  overall  CDF  will  also  be  an  exponential  poly¬ 
nomial,  and  can  be  computed  symbolically  in  terms  of  t .  An  exponential  polynomial  is  defined  to  be  an 
expression  of  the  form 

i 

Restricting  distributions  to  this  form  is  a  rather  weak  restriction,  since  it  includes  exponential,  hyperex¬ 
ponential,  Erlang  and  mixtures  of  Erlang  distributions,  in  addition  to  Neuts’  phase-type  distribution. 

Exponential  polynomials  are  closed  under  the  operations  of  addition,  subtraction,  multiplication, 
differentiation  and  integration.  Because  exponential  polynomials  are  closed  under  these  operations,  a 
series-parallel  graph  whose  nodes  have  exponential  polynomials  for  CDF’s  will  have  an  overall  CDF  which 
is  also  an  exponential  polynomial. 

341.  The  SPADE  Program 

The  process  of  finding  the  overall  CDF  of  a  series-parallel  graph  whose  nodes  have  CDF’s  which  are 
exponential  polynomials  can  be  automated.  The  program  SPADE  accepts  a  specification  of  such  a  graph 
and  produces  the  overall  CDF,  mean  and  variance,  and  computes  the  overall  CDF  for  a  specified  range  of 
values. 

In  order  to  specify  a  model  for  SPADE,  a  user  must  supply,  either  interactively  or  in  a  file,  four 
kinds  of  information:  the  graph  edges,  the  parallelism  types  for  parallel  subgraphs,  the  distribution  for 
each  graph  node,  and  a  set  of  intervals  over  which  to  evaluate  the  overall  graph  CDF. 

Because  our  definition  of  series-parallel  requires  subgraphs  combined  in  series  to  begin  or  end  with  a 
single  node,  every  group  of  parallel  subgraphs  is  preceded  by  some  single  node,  except  when  the  overall 


graph  has  multiple  entrance  nodes.  In  that  case,  SPADE  supplies  a  dummy  entrance  node.  Thus,  to 
specify  the  type  of  parallelism  for  parallel  subgraphs,  the  user  assigns  an  "exit  type*  to  each  node  that  has 
more  than  one  immediate  successor.  The  three  choices  for  exit  type  are  maximum,  minimum  or  proba¬ 
bilistic.  If  the  exit  type  is  probabilistic,  the  user  must  specify  the  probabilities. 

Every  node  in  the  graph  must  be  assigned  a  cumulative  distribution  function.  Each  CDF  must  have 
exponential  polynomial  form.  In  general,  for  each  CDF  the  user  must  provide  the  three  parameters  a,- ,  1, 
and  6,-  for  each  term  in  the  exponential  polynomial.  For  the  user’s  convenience,  SPADE  provides  an 
abbreviation  for  the  exponential  distribution,  so  that  the  user  need  only  supply  the  parameter  (inverse  of 
the  mean)  of  the  distribution. 

SPADE  also  allows  a  node  to  have  distribution  "sero",  meaning  that  the  traversal  time  for  the  node 
is  sero.  These  "sera"  nodes  are  useful  for  representing  synchronisation,  and  to  form  graphs  which  would 
otherwise  not  adhere  to  series-parallel  form.  The  examples  in  section  4  include  instances  where  sero  nodes 
are  used. 

Once  all  information  about  the  graph  has  been  obtained,  the  program  forma  a  binary  tree 
corresponding  to  the  series-parallel  graph.  The  graph  decomposition  results  in  a  preorder  representation 
of  the  tree.  The  program  then  backs  up  the  preorder  representation  (traversing  the  tree  in  "reverse  pre¬ 
order”),  computing  the  CDF  of  each  subtree.  Once  the  overall  CDF  is  known,  SPADE  evaluates  the  CDF 
over  intervals  specified  by  the  user. 

4.  APPLICATIONS 

This  section  presents  examples  of  applications  of  the  task  graphs  defined  in  section  2.  The  applica¬ 
tions  are  of  two  kinds:  analysis  of  concurrent  task  execution  and  reliability  analysis. 

4.1.  Concurrent  Program  Execution  Time 


Example  1.  Consider  a  sequential  program  which  consists  of  two  phases.  The  outcome  of  the  first  phase 
determines  which  of  two  alternative  tasks  is  executed  as  the  second  phase.  Thus,  the  node  representing 
the  first  phase  has  a  probabilistic  exit.  This  program  is  illustrated  in  figure  7a. 


The  execution  time  of  task  1  (the  first  phase)  is  exponentially  distributed  with  parameter  Xj  —  0.6. 
The  execution  time  of  task  2  is  2-stage  Erlang  distributed  with  parameter  Xj  “  1  (the  CDF  is  l-e~*  -te~* ) 
and  the  execution  time  of  task  3  is  exponentially  distributed  with  parameter  Xs  —  0.8.  The  probability 


that  task  2  will  be  executed  is  p  *  0.9  while  the  corresponding  probability  for  task  3  is  1  -  p  . 

A  specification  of  the  model  is  shown  in  figure  7b.  The  edges  of  the  graph  are  specified  by  the  two 
node  pairs  (1,2)  and  (1,3).  The  exit  type  for  node  1  is  specified  as  probabilistic,  meaning  that  the  two 
parallel  subgraphs  {2}  and  (3)  are  probabilistic.  The  distributions  for  nodes  1  and  3  are  given  as 
exponential,  with  appropriate  parameters.  The  distribution  for  node  2  is  given  by  specifying  three 
exponential  polynomial  terms.  SPADE  is  asked  to  compute  the  overall  CDF  for  values  of  t  between  2 
and  10,  at  intervals  of  2. 

The  results  given  by  SPADE  are  shown  in  figure  7c. 

Example  t.  Next  we  consider  the  producer-consumer  problem.  This  problem  is  also  explored  in  [19],  in  the 
context  of  real-tune  execution  analysis.  The  producer  process  produces  messages  according  to  an  exponen¬ 
tial  distribution  with  rate  p.  The  consumer  process  consumes  the  messages  according  to  an  exponential 
distribution  with  rate  X.  We  are  interested  in  how  long  it  takes  for  n  messages  to  be  produced  and  con¬ 
sumed. 

The  process  of  the  production  and  consumption  of  two  messages  is  shown  by  the  graph  in  figure  8a. 
The  nodes  Pi  and  P2  represent  the  production  of  the  first  and  second  messages;  Cl  and  C2  represent  the 
consumption  of  the  messages.  Obviously,  each  message  cannot  be  consumed  until  it  is  produced,  but  it  is 
possible  for  message  one  to  be  consumed  while  message  two  is  being  produced. 

With  p  =  .5  and  X  ”=  .3,  we  have  the  input  file  and  results  in  figures  8b  and  8c,  respectively. 
Example  3.  We  evaluate  one  iteration  of  the  program  with  CPU-I/0  overlap  considered  by  Towsley, 
Cbandy  and  Browne  [7]  and  shown  in  figure  9a.  In  each  iteration  of  the  program,  there  are  two  stages. 
The  first  stage  is  always  a  CPU  bunt.  The  second  stage  consists  of  either  pure  input/output,  or 
input/output  that  may  be  overlapped  with  a  second  CPU  burst.  The  probability  that  the  second  stage 
consists  of  CPU-I/O  overlap  is  given  by  p.  Note  the  use  of  a  *iero”  node.  This  allows  us  to  have  one 
branch  of  the  CPUl  node  lead  to  a  single  node,  while  the  other  branch  leads  to  a  group  of  nodes  to  be 
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executed  in  parallel. 

Assuming  /ii=0.125,  pj* 0.0376,  X— 0.0217,  and  p  >>>0.6,  the  results  from  SPADE  are  given  in  figure 
8b.  Note  that  in  SPADE  the  distributions  could  have  been  more  general. 

Example  4 ■  In  this  example  we  consider  the  process  communication  graph  [13]  discussed  in  section  2.1  and 
pictured  in  figure  1.  Assuming  that  X— 0.125  and  s  -»10,  the  distribution  function  and  the  mean  and 
variance  of  the  overall  execution  time  computed  by  SPADE  are  in  figure  10. 

42.  Reliability  Analysis 

We  now  consider  a  sequence  of  examples  of  the  use  of  SPADE  for  reliability  analysis. 

Example  5.  Consider  the  series-parallel  system  discussed  in  section  2.1  and  illustrated  in  figures  2a  and 
2b.  The  system  can  be  specified  for  SPADE  as  in  figure  11a.  Note  the  use  of  *sero”  nodes.  The  results 
are  given  in  figure  lib. 

Example  6.  We  consider  a  triple  modular  redundant  (TMR)  system.  Such  a  system  consists  of  three  com¬ 
ponents,  any  two  of  which  must  be  functioning  in  order  for  the  system  to  work.  A  task  graph -for  the  life¬ 
time  of  this  system  can  be  derived  based  on  example  3.24  of  [l]  and  is  shown  in  figure  12a.  Nodes  2,  3, 
and  4  combined  in  'minimum”  parallelism  represent  the  time  to  failure  of  the  first  component  to  fail. 
Nodes  6  and  7  represent  the  time  to  failure  of  the  second  component  to  fail.  In  this  case  the  distributions 
must  be  exponential,  since  we  rely  on  the  memoryless  property  of  the  exponential  distribution  when 
modeling  the  system  in  this  manner.  The  solution  from  SPADE  is  given  in  figure  12b. 

It  is  possible  to  simplify  the  task  graph  for  this  example  as  shown  in  figure  12c  (based  on  example 
3.26  of  [l]).  The  results  are,  of  course,  the  same. 

5.  CONCLUSION 

A  model  for  the  performance  and  reliability  analysis  using  directed  acyclic  graphs  is  discussed  here. 
The  attractiveness  of  the  model  lies  in  an  efficient  procedure  for  its  evaluation,  potentially  wide  class  of 
applications,  and  a  powerful  class  of  allowed  distributiona.  The  restriction  of  the  model  involves  struc¬ 
tural  restrictions  on  the  kinds  of  models  allowed.  Structural  restrictions  include  an  inability  to  allow 
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cycles  and  the  restriction  to  series-parallel  graphs.  Work  is  currently  under  way  to  extend  the  model  so  as 
to  remove  some  of  the  restrictions. 
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Figure  7a.  Program  with  Probabilistic  Tasks 
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Figure  7b.  Input  for  Example  1 


Figure  7c.  Results  for  Example  1 
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Figure  10.  Results  for  Example  4 
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Figure  12a.  TMR  System 
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